Transcoder

ABSTRACT

A transcoder that controls the amount of generated codes of an output stream toward a target bit rate without degradation of image quality. The transcoder decodes a first stream and encodes the decoded image again to thereby output a second stream. The transcoder calculates a distortion evaluation value from the image obtained by decoding the first stream and an image obtained by decoding the second stream. Assuming that a ratio between the distortion evaluation value and a total target distortion evaluation value is determined as a target distortion ratio, a target setting bit rate of a second stream in the period can be obtained by multiplying a total target bit rate of the second stream by the target distortion ratio. Alternatively, the target setting bit rate can be obtained by adjusting the target distortion ratio with an appropriate function and adding the target distortion ratio to the total target bit rate of the second stream.

TECHNICAL FIELD

The present invention relates to a transcoder for converting an input stream into a different output stream by decoding, and more particularly to a technique to appropriately control the amount of generated codes of the output stream toward a target bit rate.

BACKGROUND ART

Images to be delivered on digital broadcasts, those to be stored in media such as DVDs and hard disks, and the like are compressed by various coding systems. The object for such compressions is to avoid constraint on a transmission band, increase the transmission speed, decrease the memory size, or the like.

There are various standards for image coding system, such as MPEG2 and H.264. And there is a case where the coding system is converted for the purpose of reducing the amount of codes in a coded image that is inputted, or the like. A transcoder once decodes the inputted coded image. Then, the transcoder encodes the decoded image by a different coding system (or the same coding system) again. Thus, the transcoder controls a bit rate of an output stream.

The following Patent Document 1 relates to a transcoder for converting an input stream compressed by a first compressive coding system into an output stream compressed by a second compressive coding system. This transcoder acquires the degree of dispersion of an image and the degree of variation of the image in a time direction during the decoding of the input stream compressed by the first compressive coding system. By using these values as evaluation values, the transcoder calculates a quantization step value to be used for the second compressive coding.

[Patent Document 1] Japanese Patent Application Laid Open Gazette No. 2008-42426

The coded image conversion technique disclosed in Patent Document 1 aims to obtain an image of high quality from the output stream while controlling the bit rate to be a target value. In other words, a method of simply controlling the bit rate to be constant regardless of the content of an image has a possibility of causing degradation of image quality due to uniform bit allocation. By adopting the method disclosed in Patent Document 1, a large number of bits are allocated to an image which needs more bits to be allocated thereto and no superfluous bits are allocated to another image which does not need more bits to be allocated thereto.

Thus, in the technique for calculating an optimal quantization step value on the basis of information on the image obtained from the input stream, the optimal bit allocation is basically predicted from the condition of the input stream. Therefore, it can be expected to achieve further optimization if the condition of the image of the actually generated output stream is also taken into consideration.

DISCLOSURE OF INVENTION

The present invention is intended for a transcoder converting a first stream into a second stream. According to an aspect of the present invention, the transcoder comprises a part for setting a target bit rate of a whole second stream, and a target bit rate determination part for determining a target bit rate of a second stream in the (N+1) period on the basis of the target bit rate of the whole second stream, a distortion evaluation value of a second stream converted in or before the N period, and a target distortion evaluation value of the whole second stream.

It thereby becomes possible to optimally control the bit rate of the second stream toward the target bit rate without degradation of image quality.

According to another preferred embodiment of the present invention, the transcoder of the present invention comprises a part for setting a target bit rate of a whole second stream, and a target bit rate determination part for determining a target bit rate of a second stream in the (N+1) period on the basis of the target bit rate of the whole second stream, a bit rate and a distortion evaluation value of a second stream converted in or before the N period, a target bit rate of a second stream in the N period, and a target distortion evaluation value of the whole second stream.

Therefore, it is an object of the present invention to appropriately control the amount of generated codes of an output stream toward a target bit rate while maintaining high quality of the image of the output stream.

These and other objects, features, aspects and advantages of the present invention will become more apparent from the following detailed description of the present invention when taken in conjunction with the accompanying drawings.

BRIEF DESCRIPTION OF DRAWINGS

[FIG. 1] is a block diagram showing a transcoder;

[FIG. 2] is a view showing information on an input stream (first stream) and an output stream (second stream) per control unit time;

[FIG. 3] is a view showing a characteristic of a function θ₁;

[FIG. 4] is a view showing a characteristic of a function θ₂;

[FIG. 5] is a view showing a characteristic of a function θ₃;

[FIG. 6] is a view showing a characteristic of a function θ₄;

[FIG. 7] is a view showing a characteristic of a function θ₅;

[FIG. 8] is a view showing a characteristic of a function θ₆; and

[FIG. 9] is a block diagram showing a transcoder in accordance with a seventh preferred embodiment.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

Hereinafter, with reference to figures, the preferred embodiments of the present invention will be discussed. FIG. 1 is a block diagram showing a transcoder 1 in accordance with the preferred embodiments. The transcoder 1 comprises a CPU 11, a first decoder 12, an image memory 13, a step value calculation part 14, an encoder 15, a buffer memory 16, a second decoder 17, and a distortion calculation part 18.

The first decoder 12 inputs a first stream. The first stream is a stream of coded image. The first decoder 12 decodes the first stream and outputs uncompressed image data to the image memory 13. The encoder 15 again codes the uncompressed image data which is decoded and stored in the image memory 13, to thereby output a second stream.

The transcoder 1 converts a coding system of stream, and for example, inputs a first stream coded in MPEG2 and outputs a second stream coded in H.264. The present invention is devised in order to optimally control the rate of the second stream to be outputted in the conversion. Alternatively, the transcoder 1 outputs a stream of the same coding system, and for example, inputs a first stream coded in MPEG2 and outputs a second stream recoded in MPEG2. Also in this case, the present invention is devised in order to optimally control the rate of the second stream to be outputted.

Various computations performed in the first decoder 12, the step value calculation part 14, the encoder 15, the second decoder 17, and the distortion calculation part 18 may be implemented by hardware or may be implemented by software operations. In other words, these processing parts may be constructed as hardware circuits or implemented by a CPU and programs stored in memories. Alternatively, there may be a case where some of the computations are performed by hardware and the others are performed by software.

FIG. 2 is a view showing information on streams that the transcoder 1 inputs or outputs per control unit time. The transcoder 1 divides the processing time base into control unit times L_(n) (n=1, 2 . . . ) and performs a rate control by using this control unit time L_(n) as a unit of processing. In the following discussion, the control unit time L_(n) is referred to as “the n period” as appropriate. As to the control unit time L_(n), one frame, a plurality of successive frames, one GOP, a plurality of successive GOPs, or the like may be set as one unit time.

A total input bit rate S of the first stream is acquired from a sequence header or the like. An average input bit rate S_(n) is an average bit rate of the first stream in the n period. The transcoder 1 can store information on the average input bit rates S_(n) for M periods in the buffer memory 16. Specifically, the buffer memory 16 can store information on the average input bit rates S_(n) from the (n−M+1) period to the n period. An average period bit rate AS_(n) is an average value of the average input bit rates S_(n) from the (n−M+1) period to the n period. The average period bit rate AS_(n) is expressed by Eq. (1).

$\begin{matrix} {{AS}_{n} = {\frac{1}{M}{\sum\limits_{n - M + 1}^{n}S_{i}}}} & (1) \end{matrix}$

A total target bit rate T of the second stream is set by a user. For example, the user uses a not-shown operation part provided in the transcoder 1 to set the total target bit rate T.

A target setting bit rate T_(n) is a target bit rate of the second stream in the n period. An average output bit rate C_(n) is an average bit rate of the second stream which is already converted in the n period. The transcoder 1 can store information on the average output bit rates C_(n) for M periods in the buffer memory 16. Specifically, the buffer memory 16 can store information on the average output bit rates C_(n) from the (n−M+1) period to the n period. An average period bit rate AC_(n) is an average value of the average output bit rates C_(n) from the (n−M+1) period to the n period. The average period bit rate AC_(n) is expressed by Eq. (2). Though the buffering period used for calculation of the average period bit rate AS_(n) or AC_(n) is linked to the control unit time L_(n) in the preferred embodiments, setting of the buffering period is not limited to this case. For example, one past frame, a plurality of successive past frames, one past GOP, a plurality of successive past GOPs, or the like at the point of time when the coding is completed may be set as the buffering period.

$\begin{matrix} {{AC}_{n} = {\frac{1}{M}{\sum\limits_{n - M + 1}^{n}C_{i}}}} & (2) \end{matrix}$

A total target distortion evaluation value d of the second stream is set by the user. For example, the user uses the not-shown operation part provided in the transcoder 1 to set the total target distortion evaluation value d.

An average distortion evaluation value d_(n) is an evaluation value indicating the degree of image distortion of the second stream which is already converted in the n period. The transcoder 1 can store information on the average distortion evaluation values d_(n) for M periods in the buffer memory 16. Specifically, the buffer memory 16 can store information on the average distortion evaluation values d_(n) from the (n−M+1) period to the n period. An average period distortion evaluation value Ad_(n) is an average value of the average distortion evaluation values d_(n) from the (n−M+1) period to the n period. The average period distortion evaluation value Ad_(n) is expressed by Eq. (3).

$\begin{matrix} {{Ad}_{n} = {\frac{1}{M}{\sum\limits_{n - M + 1}^{n}d_{i}}}} & (3) \end{matrix}$

A quantization step conversion factor α_(n) is a factor calculated at the point of time when the (n−1) period is finished. A quantization step value Q of the second stream is determined by multiplying a value P, which is a quantization step value of the first stream or a value calculated from the quantization step value of the first stream, by the quantization step conversion factor α_(n). This relation is expressed by Eq. (4).

Q−α_(n)P   (4)

Referring back to FIG. 1, the CPU 11 acquires a bit rate s_b for each picture of the first stream from the first decoder 12 and stores the bit rate s_b into the buffer memory 16. The CPU 11 calculates the average input bit rate S_(n) and the average period bit rate AS_(n) in the n period on the basis of the bit rate s_b stored in the buffer memory 16.

The first decoder 12 outputs the quantization step value P for each macroblock of the first stream to the step value calculation part 14.

The CPU 11 acquires a bit rate c_b for each picture of the converted second stream from the encoder 15 and stores the bit rate c_b into the buffer memory 16. The CPU 11 calculates the average input bit rate C_(n) and the average period bit rate AC_(n) in the n period on the basis of the bit rate c_b stored in the buffer memory 16.

The CPU 11 acquires the total target bit rate T and the total target distortion evaluation value d which are set by the user.

The encoder 15 encodes the uncompressed image data on the basis of the quantization step value Q inputted from the step value calculation part 14 to thereby generate the second stream. The second decoder 17 inputs the second stream encoded by the encoder 15 and decodes the second stream again to obtain the uncompressed image data. The distortion calculation part 18 inputs the uncompressed image data from each of the image memory 13 and the second decoder 17 and calculates a distortion evaluation value dI for each picture of the second stream.

For example, the distortion calculation part 18 compares the uncompressed image data acquired from the image memory 13 with the uncompressed image data acquired from the second decoder 17 and calculates a sum of squares of pixel difference values of these images, to thereby calculate the distortion evaluation value dI for each picture. Alternatively, the distortion evaluation value dI may be calculated by using a sum of absolute values of the pixel difference values of these images.

The CPU 11 inputs the distortion evaluation value dI for each picture from the distortion calculation part 18 and stores the distortion evaluation value dI into the buffer memory 16. The CPU 11 calculates the average distortion evaluation value d_(n) and the average period distortion evaluation value Ad_(n) from the distortion evaluation value dI stored in the buffer memory 16.

In the preferred embodiments, the smaller the distortion evaluation value dI is, the smaller the distortion of the image is. Therefore, also as to the average distortion evaluation value d_(n) and the average period distortion evaluation value Ad_(n), the smaller the value thereof is, the smaller the distortion of the image is. In the calculation of the distortion evaluation value dI, a weighting in accordance with a color component may be given.

The CPU 11 performs computations shown in the first to sixth preferred embodiments discussed later by using information on the total target bit rate T, the total target distortion evaluation value d, the average distortion evaluation value d_(n), the average period distortion evaluation value Ad_(n), the average output bit rate C_(n), the average period bit rate AC_(n), the average input bit rate S_(n), the average period bit rate AS_(n), and the like, to thereby determine the target setting bit rate T_(n+1) of the second stream in the (n+1) period. Further, the CPU 11 calculates the quantization step conversion factor α_(n+1) from the target setting bit rate T_(n+1) of the second stream in the (n+1) period and outputs the quantization step conversion factor α_(n+1) to the step value calculation part 14. The step value calculation part 14 performs the computation expressed by Eq. (4) to calculate the quantization step value Q of the second stream and gives the quantization step value Q to the encoder 15.

It can be seen from Eq. (4) that the quantization step conversion factor α_(n) is a ratio between the quantization step values. A relation between the ratio R_(B) between the bit rates and the ratio R_(Q) between the quantization step values can be expressed as Eq. (5) using a function f.

R _(Q) =f(R _(B))   (5)

A specific example of the function f will be discussed. In these preferred embodiments, as shown in Eq. (6), used are functions which can control the ratio between the quantization step values with different characteristics for types I, P and B (I picture, P picture and B picture) of a picture or a macroblock. In Eq. (6), f_(I)(x), f_(P)(x) and f_(B)(x) are functions adapted correspondingly to the I picture, the P picture and the B picture, respectively.

$\begin{matrix} \left\{ \begin{matrix} {{f_{I}(x)} = {{ɛ_{I}*x^{- \delta_{I}}} = {2.1*x^{- 1.1}}}} \\ {{f_{P}(x)} = {{ɛ_{P}*x^{- \delta_{P}}} = {2.1*x^{- 0.8}}}} \\ {{f_{B}(x)} = {{ɛ_{B}*x^{- \delta_{B}}} = {1.3*x^{- 0.9}}}} \end{matrix} \right. & (6) \end{matrix}$

In Eq. (6), as values of ∈_(I), ∈_(P), ∈_(B), δ_(I), δ_(P) and δ_(B) in the case of stream conversion from MPEG2 into H.264, specific numerical values calculated on the basis of an experiment or the like are shown. Specifically, as the result of the experiment, when ∈₁=2.1, ∈_(P)=2.1, ∈_(B)=1.3, δ_(I)=1.1, δ_(P)=0.8 and δ_(B)=0.9, an excellent feature can be obtained. Other than this, by using the feature value of the image which is acquired in decoding the input stream, the values of ∈ and δ may be determined. As expressed by Eq. (7), for example, on the basis of an activity value (act value) and a motion evaluation value (sad value) of the input stream, the values of ∈ and δ may be determined.

$\begin{matrix} \left\{ \begin{matrix} {{f_{I}(x)} = {{ɛ_{I}\left( {a,s} \right)}*x^{- {\delta_{I}{({a,s})}}}}} \\ {{f_{P}(x)} = {{ɛ_{P}\left( {a,s} \right)}*x^{- {\delta_{P}{({a,s})}}}}} \\ {{f_{B}(x)} = {{ɛ_{B}\left( {a,s} \right)}*x^{- {\delta_{B}{({a,s})}}}}} \end{matrix} \right. & (7) \end{matrix}$

In Eq. (7), “a” represents the act value and “s” represents the sad value, and ∈₁(a, s), ∈_(P)(a, s), ∈_(B)(a, s), −δ_(I)(a, s), −δ_(P)(a, s), −δ_(B)(a, s) represent the values of ∈ and δ which are calculated by using the act value and the sad value as parameters.

The activity value is obtained by calculating a sum of absolute differences (SAD) of an average pixel value in a macroblock and a pixel value of each pixel in the macroblock by the macroblock. In other words, the activity value is an evaluation value indicating the degree of dispersion of pixels in the macroblock. This is the same as an activity value used in the code amount control model TM5 of MPEG2 or the like. The motion evaluation value (sad value) is obtained by calculating an interframe sum of absolute differences of a pixel value of each pixel in a reference image macroblock and a pixel value of the corresponding pixel in a macroblock, by the macroblock. In other words, the motion evaluation value is obtained by comparing each pixel in a macroblock with the corresponding pixel in the reference image macroblock and calculating a sum of absolute values of differences of pixel values of corresponding pixels in the same coordinate positions.

More generally, assuming that an feature value of an image in the n period is determined as I_(Ln), f_(I)(x), f_(P)(x) and f_(B)(x) are expressed by Eq. (8).

$\begin{matrix} \left\{ \begin{matrix} {{f_{I}(x)} = {{ɛ_{I}\left( I_{L_{n}} \right)}*x^{- {\delta_{I}{(I_{L_{n}})}}}}} \\ {{f_{P}(x)} = {{ɛ_{P}\left( I_{L_{n}} \right)}*x^{- {\delta_{I}{(I_{L_{n}})}}}}} \\ {{f_{B}(x)} = {{ɛ_{B}\left( I_{L_{n}} \right)}*x^{- {\delta_{I}{(I_{L_{n}})}}}}} \end{matrix} \right. & (8) \end{matrix}$

In Eq. (8), ∈_(I)(I_(Ln)), ∈_(P)(I_(Ln)), ∈_(B)(I_(Ln)), −δ_(I)(I_(Ln)), −δ_(P)(I_(Ln)) and −δ_(B)(I_(Ln)) represent the factors ∈ and δ which are determined by using the feature value I_(Ln) of the image as a parameter.

The First Preferred Embodiment

Next, the first preferred embodiment of the present invention will be discussed. The transcoder 1 of the first preferred embodiment determines the target setting bit rate T_(n+1) of the second stream in the (n+1) period at the point of time when the n period is finished. After the target setting bit rate T_(n+1) in the (n+1) period is determined, the quantization step conversion factor α_(n+1) is calculated in the same manner as discussed with reference to Eq. (5). In other words, the quantization step conversion factor α_(n+1) is calculated by using the function f shown in Eqs. (6) to (8).

Specifically, as expressed by Eq. (9), the quantization step conversion factor α_(n+1) is calculated by substituting the ratio between the target setting bit rate T_(n+1) in the (n+1) period and the average input bit rate S_(n) in the n period into the function f. Then, the computation expressed by Eq. (4) is performed by using the calculated quantization step conversion factor α_(n+1), to thereby calculate the quantization step value Q of the second stream.

α_(n+1) =f(T _(n+1) /S _(n))   (9)

Alternatively, as expressed by Eq. (10), the quantization step conversion factor α_(n+1) may be calculated by using the average period bit rate AS_(n) from the (n−M+1) period to the n period, instead of the average input bit rate S_(n) in the n period.

α_(n+1) =f(T _(n+1) /AS _(n))   (10)

Eq. (11) is an equation for calculation of the target setting bit rate T_(n+1) in the (n+1) period. In other words, Eq. (11) represents the target setting bit rate in the (n+1) period calculated by the transcoder 1 after the n period is finished. Specifically, assuming that a value obtained by dividing the average distortion evaluation value d_(n) of the second stream which is already converted in the n period by the total target distortion evaluation value d is determined as a target distortion ratio, the target setting bit rate T_(n+1) in the (n+1) period can be calculated by multiplying the total target bit rate T of the second stream by the target distortion ratio.

$\begin{matrix} {T_{n + 1} = {T*k*\frac{d_{n}}{d}}} & (11) \end{matrix}$

In Eq. (11), “k” is a positive factor and a factor for adjusting the target distortion ratio. When the average distortion evaluation value d_(n) in the n period exceeds the total target distortion evaluation value d, the image quality is lower than the target quality. In this case, adjustment is made toward the direction in which d_(n)/d will become larger than 1 and the target setting bit rate T_(n+1) will become larger. In the reverse case where the d_(n)/d is smaller than 1, since the image quality is higher than the target quality, the target setting bit rate T_(n+1) is suppressed. The bit rate by which the target setting bit rate T_(n+1) is suppressed may be allocated to another control unit time where the image quality is degraded. When the image quality is higher than the target quality, it is not always necessary to suppress the target setting bit rate T_(n+1). Thus, the ratio of the image distortion to the target distortion is calculated by the d_(n)/d and the target setting bit rate T_(n+1) in the (n+1) period is adjusted by multiplying the total target bit rate T by the target distortion ratio so that the distortion of the image may not become larger.

Though the total target bit rate T of the second stream is multiplied by the d_(n)/d as the target distortion ratio (while the factor k is also used as a multiplier) in Eq. (11), the target distortion ratio may be obtained by substituting the d_(n)/d into a function θ_(n) as expressed by Eq. (12). In this case, the target setting bit rate T_(n+1) can be adjusted by adding the target distortion ratio to the total target bit rate T.

$\begin{matrix} {T_{n + 1} = {T + {\theta_{1}\left( \frac{d_{n}}{d} \right)}}} & (12) \end{matrix}$

Eq. (13) is an equation representing the specific example of the function θ₁.

$\begin{matrix} {{\theta_{1}(x)} = \left\{ \begin{matrix} {x - 1 - b_{0}} & \left( {x \geq {1 + c_{0}}} \right) \\ {a_{0}*\left( {x - 1} \right)^{3}} & \left( {{1 + c_{0}} > x \geq 1} \right) \\ {a_{1}*\left( {x - 1} \right)^{3}} & \left( {1 > x \geq {1 - c_{1}}} \right) \\ {x - 1 + b_{1}} & \left( {{1 - c_{1}} > x} \right) \end{matrix} \right.} & (13) \end{matrix}$

A relation of a₀, a₁, b₀, b₁, c₀, and c₁ in Eq. (13) is expressed by Eq. (14). A characteristic of the function θ₁ represented by such a condition is shown in FIG. 3.

$\begin{matrix} {{{a_{0} > 0},\mspace{14mu} {a_{1} > 0}}{{c_{0} = \frac{1}{\sqrt{3*a_{0}}}},\mspace{14mu} {c_{1} = \frac{1}{\sqrt{3*a_{1}}}}}{{b_{0} = {\frac{2}{3}*c_{0}}},\mspace{14mu} {b_{1} = {\frac{2}{3}*c_{1}}}}} & (14) \end{matrix}$

Thus, the function θ₁ has the characteristic that when an input value is close to “1”, an output does not vary significantly from “0”. From the time when the input value becomes larger than a threshold value (or when the input value becomes smaller than a threshold value), the output value also varies linearly. In the specific examples expressed by Eqs. (13) and (14), the variation of the output value also becomes larger from the time when the input value becomes larger than (1+c₀) or becomes smaller than (1−c₁).

By using the function θ₁, it is possible to avoid significant variation of the target setting bit rate T_(n+1) from the total target bit rate T even when the d_(n)/d deviates a little from “1”. In a case where the target distortion ratio relatively largely deviates, by varying the target setting bit rate it is possible to ensure improvement of the image quality.

In the exemplary case shown in FIG. 3, the function θ₁ has the characteristic of bilateral symmetry (point symmetry with respect to the point (1, 0)). In other words, the rate of change is symmetric with respect to a boundary where the value of the d_(n)/d is 1. The rate of increase in the target distortion ratio in the case where the value of the d_(n)/d is larger than 1 is equal to the rate of decrease in the target distortion ratio in the case where the value of the d_(n)/d is smaller than 1. The rate of change may be asymmetric with respect to a boundary where the value of the d_(n)/d is 1. In this case, the rate of increase in the target distortion ratio in the case where the value of the d_(n)/d is larger than 1 is different from the rate of decrease in the target distortion ratio in the case where the value of the d_(n)/d is smaller than 1. By controlling the values of a₀ and a₁, it is possible to provide a characteristic of bilateral asymmetry. By increasing a_(n) and decreasing a₁, for example, when the distortion evaluation value of the image is larger than a target distortion value, quick improvement of the image quality is performed and when the distortion evaluation value of the image is smaller than the target distortion value, the bit rate is gently suppressed.

The Second Preferred Embodiment

Next, the second preferred embodiment of the present invention will be discussed. Also in the second preferred embodiment, like in the first preferred embodiment, the transcoder 1 determines the target setting bit rate T_(n+1) of the second stream in the (n+1) period at the point of time when the n period is finished.

After the target setting bit rate T_(n+1) in the (n+1) period is determined, the quantization step conversion factor α_(n+1) is calculated in the same manner as discussed with reference to Eq. (5). In other words, the quantization step conversion factor α_(n+1) is calculated by performing the computation expressed by Eq. (9) or Eq. (10) using the function f shown in Eqs. (6) to (8). After the quantization step conversion factor α_(n+1) is calculated, the quantization step value Q of the second stream is calculated by using Eq. (4).

Eq. (15) is an equation for calculation of the target setting bit rate T_(n+1) in the (n+1) period. In this case, a value obtained by dividing the average output bit rate C_(n) of the second stream which is already converted in the n period by the target setting bit rate T_(n) in the n period is determined as a target rate ratio. The target setting bit rate T_(n+1) in the (n+1) period can be calculated by multiplying a quotient of the total target bit rate T of the second stream divided by the target rate ratio by the target distortion ratio. The target distortion ratio is a value obtained by dividing the average distortion evaluation value d_(n) by the total target distortion evaluation value d like in the first preferred embodiment.

$\begin{matrix} {T_{n + 1} = {k*\frac{T}{\frac{C_{n}}{T_{n}}}*\frac{d_{n}}{d}}} & (15) \end{matrix}$

In Eq. (15), “k” is a positive factor and a factor for adjusting the target rate ratio and the target distortion ratio. A ratio of the bit rate to the target bit rate in the n period is calculated by C_(n)/T_(n) and the target setting bit rate T_(n+1) in the (n+1) period is adjusted by dividing the total target bit rate T by the target rate ratio. When the average output bit rate C_(n) exceeds the target setting bit rate T, the C_(n)/T_(n) becomes larger than 1 and then the target setting bit rate T_(n+1) is suppressed and controlled so that the bit rate should be close to the target bit rate on the whole.

Further, the total target bit rate T is multiplied by the target distortion ratio (d_(n)/d) and when the image quality is degraded (d_(n)/d is larger than 1), adjustment is made toward the direction to increase the target setting bit rate T_(n+1). It is thereby possible to improve the image quality while maintaining the target bit rate. In the reverse case where the d_(n)/d is smaller than 1, since the image quality is higher than the target quality, the target setting bit rate T_(n+1) is suppressed. The bit rate by which the target setting bit rate T_(n+1) is suppressed may be allocated to another control unit time where the image quality is degraded. When the image quality is higher than the target quality, it is not always necessary to suppress the target setting bit rate T_(n+1).

Though the C_(n)/T_(n) is used as the target rate ratio (while the factor k is also used as a multiplier) in Eq. (15), the target rate ratio may be obtained by substituting the C_(n)/T_(n) into a function θ₂ as expressed by Eq. (16).

$\begin{matrix} {T_{n + 1} = {k*\frac{T}{\theta_{2}\left( \frac{C_{n}}{T_{n}} \right)}*\frac{d_{n}}{d}}} & (16) \end{matrix}$

Eq. (17) is an equation representing the specific example of the function θ₂.

$\begin{matrix} {{\theta_{2}(x)} = \left\{ \begin{matrix} {x - b_{0}} & \left( {x \geq {1 + c_{0}}} \right) \\ {{a_{0}*\left( {x - 1} \right)^{3}} + 1} & \left( {{1 + c_{0}} > x \geq 1} \right) \\ {{a_{1}*\left( {x - 1} \right)^{3}} + 1} & \left( {1 > x \geq {1 - c_{1}}} \right) \\ {x + b_{1}} & \left( {{1 - c_{1}} > x} \right) \end{matrix} \right.} & (17) \end{matrix}$

A relation of a₀, a₁, b₀, b₁, c₀, and c₁ in Eq. (17) is expressed by Eq. (18). A characteristic of the function θ₂ represented by such a condition is shown in FIG. 4.

$\begin{matrix} {{{a_{0} > 0},\mspace{14mu} {a_{1} > 0}}{{c_{0} = \frac{1}{\sqrt{3*a_{0}}}},\mspace{14mu} {c_{1} = \frac{1}{\sqrt{3*a_{1}}}}}{{b_{0} = {\frac{2}{3}*c_{0}}},\mspace{14mu} {b_{1} = {\frac{2}{3}*c_{1}}}}} & (18) \end{matrix}$

Thus, the function θ₂ has the characteristic that when the input value is close to “1”, an output does not vary significantly from “1”. From the time when the input value becomes larger than a threshold value (or when the input value becomes smaller than a threshold value), the output value also varies linearly. In the specific examples expressed by Eqs. (17) and (18), the variation of the output value also becomes larger from the time when the input value becomes larger than (1+c₀) or becomes smaller than (1−c₁).

By using the function θ₂, it is possible to avoid significant variation of the target setting bit rate T_(n+1) from the total target bit rate T even when the C_(n)/T_(n) deviates a little from “1”. In a case where the target rate ratio relatively largely deviates, by varying the target setting bit rate T_(n+1), it is possible to optimally follow the target.

In the exemplary case shown in FIG. 4, the function θ₂ has the characteristic of bilateral symmetry (point symmetry with respect to the point (1, 1)). In other words, the rate of change is symmetric with respect to a boundary where the value of the C_(n)/T_(n) is 1. The rate of increase in the target rate ratio in the case where the value of the C_(n)/T_(n) is larger than 1 is equal to the rate of decrease in the target rate ratio in the case where the value of the C_(n)/T_(n) is smaller than 1. The rate of change may be asymmetric with respect to a boundary where the value of the C_(n)/T_(n) is 1. In this case, the rate of increase in the target rate ratio in the case where the value of the C_(n)/T_(n) is larger than 1 is different from the rate of decrease in the target rate ratio in the case where the value of the C_(n)/T_(n) is smaller than 1. By controlling the values of a₀ and a₁, it is possible to provide a characteristic of bilateral asymmetry. By increasing a₀ and decreasing a₁, for example, when the bit rate becomes larger than the target value, the bit rate can be controlled to quickly decrease to the target value and when the bit rate becomes smaller than the target value, the bit rate can be controlled to gently increase to the target value.

Though the value obtained by dividing the average output bit rate C_(n) of the second stream in the n period by the target setting bit rate T_(n) is used as the target rate ratio in Eq. (15), the target rate ratio is further adjusted in Eq. (19).

$\begin{matrix} {T_{n + 1} = {k*\frac{T}{\frac{S_{n - 1}}{S_{n}}*\frac{C_{n}}{T_{n}}}*\frac{_{n}}{}}} & (19) \end{matrix}$

In Eq. (19), the target rate ratio is multiplied by S_(n−1)/S_(n). The S_(n−1)/S_(n) is a value obtained by dividing the average input bit rate S_(n−1) of the first stream in the (n−1) period by the average input bit rate S_(n) of the first stream in the n period and may be referred to as a period ratio of the average input bit rates. In other words, the target ratio is adjusted by multiplying the target rate ratio by the period ratio.

By multiplying the target ratio by the period ratio, it is possible to correct local variation of the target ratio. When the average input bit rate S_(n) locally decrease, for example, the target ratio C_(n)/T_(n) also accordingly decrease in some cases. Also in such a case, the target ratio is adjusted by multiplying the target ratio by the period ratio S_(n−1)/S_(n) (the period ratio takes a value larger than 1 in this case) and it is possible to avoid significant variation of the target setting bit rate T_(n+1). In the reverse case where the average input bit rate S_(n) locally increase, the period ratio S_(n+1)/S_(n) takes a value smaller than 1 and it is possible to suppress a quick increase of the target ratio.

Also in the method of calculating the target setting bit rate T_(n+1) by using the function θ₂ expressed by Eq. (16), similarly, the target rate ratio may be set by multiplying the target rate ratio by the period ratio. The target setting bit rate T_(n+1) calculated by this method is shown in Eq. (20). Also in this case, the target rate ratio is adjusted with the variation of the average input bit rate and it is possible to avoid significant variation of the target setting bit rate T_(n+1).

$\begin{matrix} {T_{n + 1} = {k*\frac{T}{\frac{S_{n - 1}}{S_{n}}*{\theta_{2}\left( \frac{C_{n}}{T_{n}} \right)}}*\frac{_{n}}{}}} & (20) \end{matrix}$

In Eq. (19), the value obtained by dividing the average input bit rate S_(n−1) of the first stream in the (n−1) period by the average input bit rate S_(n) of the first stream in the n period is used as the period ratio. On the other hand, in a case where the average input bit rate S_(n+1) of the first stream in the (n+1) period can be obtained in advance, the average input bit rate S_(n+1) may be used. The case where the average input bit rate S_(n+1) can be obtained in advance refers to a case where there is spare time to perform buffering of information on the average input bit rate S_(n+1) and then calculate the target setting bit rate T_(n+1) in the (n+1) period. In other words, it refers to a case where a processing delay to some degree is allowed.

In the case where the average input bit rate S_(n+1) can be obtained in advance, in Eq. (19), the target rate ratio may be multiplied by the period ratio S_(n)/S_(n+1), instead of the period ratio S_(n−1)/S_(n). By using the latest input information, it is possible to ensure optimal control of the target bit rate.

In Eq. (20), the target rate ratio obtained by using the function θ₂ is multiplied by the period ratio. Also in this case, similarly, the target rate ratio may be multiplied by the period ratio S_(n)/S_(n+1), instead of the period ratio S_(n−1)/S_(n). It is thereby possible to control the target setting bit rate T_(n+1) with higher accuracy.

In the case where the average input bit rate S_(n+1) of the first stream in the (n+1) period is used as the period ratio, instead of Eq. (9), as expressed by Eq. (21), by substituting a ratio between the target setting bit rate T_(n+1), in the (n+1) period and the average input bit rate S_(n+1) in the (n+1) period into the function f, the quantization step conversion factor α_(n+1) in the (n+1) period can be calculated.

α_(n+1) =f(T _(n+1) /S _(n+1))   (21)

In Eq. (19) and Eq. (20), the S_(n−1)/S_(n) is used as the period ratio. Instead, a ratio between the average period bit rate AS_(n−1) for past M periods including the (n−1) period and the average period bit rate AS_(n) for past M periods including the n period may be used as the period ratio. In other words, in Eqs. (19) and (20), AS_(n−1)/AS_(n) may be used as the period ratio, instead of the S_(n−1)/S_(n). It is thereby possible to ensure optimal control of the target setting bit rate T_(n+1) by reducing the effect of the local variation.

Alternatively, in the case where the average input bit rate S_(n+1) of the first stream in the (n+1) period can be obtained in advance, a ratio between the average period bit rate AS_(n) for the past M periods including the n period and the average period bit rate AS_(n+1) for past M periods including the (n+1) period may be used as the period ratio. In other words, in Eqs. (19) and (20), AS_(n)/AS_(n+1) may be used as the period ratio, instead of the S_(n−1)/S_(n). It is thereby possible to control the target setting bit rate T_(n+1) with higher accuracy by reducing the effect of the local variation and using the latest input bit rate.

In the case where the average period bit rate AS_(n+1) for the past M periods including the (n+1) period is used as the period ratio, instead of Eq. (10), as expressed by Eq. (22), by substituting a ratio between the target setting bit rate T_(n+1) in the (n+1) period and the average period bit rate AS_(n+1) for the past M periods including the (n+1) period into the function f, the quantization step conversion factor α₊₁ in the (n+1) period can be calculated.

α_(n+1) =f(T _(n+1) /AS _(n+1))   (22)

Though the method of calculating the target setting bit rate T_(n+1) in the second preferred embodiment has been discussed with reference to Eqs. (15), (16), (19), (20), and the like, in all the cases of the second preferred embodiment including the cases discussed with reference to these equations, the average period bit rate AC_(n) from the (n−M+1) period to the n period may be used, instead of the average output bit rate C_(n) in the n period. In Eqs. (15), (16), (19), (20), and the like, for example, AC_(n) is used instead of C_(n). It is thereby possible to ensure optimal control of the target setting bit rate T_(n+1) by gently evaluating the variation of C_(n).

The Third Preferred Embodiment

Next, the third preferred embodiment of the present invention will be discussed. Also in the third preferred embodiment, like in the first and second preferred embodiments, the transcoder 1 determines the target setting bit rate T_(n+1) of the second stream in the (n+1) period at the point of time when the n period is finished.

After the target setting bit rate T_(n+1) in the (n+1) period is determined, the quantization step conversion factor α_(n+1) is calculated in the same manner as discussed with reference to Eq. (5). In other words, the quantization step conversion factor α_(n+1) is calculated by performing the computation expressed by Eq. (9) or Eq. (10) using the function f shown in Eqs. (6) to (8). After the quantization step conversion factor α_(n+1) is calculated, the quantization step value Q of the second stream is calculated by using Eq. (4).

Eq. (23) is an equation for calculation of the target setting bit rate T_(n+1) in the (n+1) period. In this case, a value obtained by subtracting the average output bit rate C_(n) of the second stream which is already converted in the n period from the target setting bit rate T_(n) in the n period is determined as a target rate difference. The target rate difference is adjusted by multiplying the target rate difference by the target distortion ratio (d_(n)/d). The target setting bit rate T_(n+1) in the (n+1) period can be calculated by adding the target rate difference adjusted by the target distortion ratio to the total target bit rate T of the second stream.

$\begin{matrix} {T_{n + 1} = {T + {k*\left( {T_{n} - C_{n}} \right)*\frac{_{n}}{}}}} & (23) \end{matrix}$

In Eq. (23), “k” is a positive factor and a factor for adjusting the target rate difference and the target distortion ratio. A difference between the bit rate and the target bit rate in the n period is calculated by (T_(n)−C_(n)) and the target setting bit rate T_(n+1) in the (n+1) period is adjusted by adding the target rate difference to the total target bit rate T. When the average output bit rate C_(n) exceeds the target setting bit rate T_(n), the (T_(n)−C_(n)) becomes a negative value and the target setting bit rate T_(n+1) is suppressed and controlled so that the bit rate should be close to the target bit rate on the whole. In the reverse case where the average output bit rate C_(n) is smaller than the target setting bit rate T_(n), the (T_(n)−C_(n)) becomes a positive value and the target setting bit rate T_(n+1) is increased and controlled so that the bit rate should be close to the target bit rate on the whole.

Further, the target rate difference is multiplied by the target distortion ratio (d_(n)/d). When the average output bit rate C_(n) is smaller than the target setting bit rate T_(n) and the image quality is lower than the target quality, the d_(n)/d becomes larger than 1 and adjustment is made toward the direction to increase an increase range of the target setting bit rate T_(n+1). It is thereby possible to quickly improve the image quality. When the average output bit rate C_(n) is larger than the target setting bit rate T_(n) and the image quality is higher than the target quality, the d_(n)/d becomes smaller than 1 and this decreases a suppression range of the target setting bit rate T_(n+1) and gently suppresses the bit rate.

Though the (T_(n)−C_(n)) is used as the target rate difference (while the factor k is also used as a multiplier) in Eq. (23), the target rate difference may be obtained by substituting the (T_(n)−C_(n)) into a function θ₃ as expressed by Eq. (24).

$\begin{matrix} {T_{n + 1} = {T + {{\theta_{3}\left( {T_{n} - C_{n}} \right)}*\frac{_{n}}{}}}} & (24) \end{matrix}$

Eq. (25) is an equation representing the specific example of the function θ₃.

$\begin{matrix} {{\theta_{3}(x)} = \left\{ \begin{matrix} {x - b_{0}} & \left( {x \geq c_{0}} \right) \\ {a_{0}*x^{3}} & \left( {c_{0} > x \geq 0} \right) \\ {a_{1}*x^{3}} & \left( {0 > x \geq c_{1}} \right) \\ {x + b_{1}} & \left( {{- c_{1}} > x} \right) \end{matrix} \right.} & (25) \end{matrix}$

A relation of a₀, a₁, b₀, b₁, c₀, and c₁ in Eq. (25) is expressed by Eq. (26). A characteristic of the function θ₃ represented by such a condition is shown in FIG. 5.

$\begin{matrix} {{{a_{0} > 0},{a_{1} > 0}}{{c_{0} = \frac{1}{\sqrt{3*a_{0}}}},{c_{1} = \frac{1}{\sqrt{3*a_{1}}}}}{{b_{0} = {\frac{2}{3}*c_{0}}},{b_{1} = {\frac{2}{3}*c_{1}}}}} & (26) \end{matrix}$

Thus, the function θ₃ has the characteristic that when the input value is close to “0”, an output does not vary significantly from “0”. In the specific examples expressed by Eqs. (25) and (26), the output value also significantly varies from the time when the input value becomes larger than c₀ or becomes smaller than −c₁.

By using the function θ₃, it is possible to avoid significant variation of the target setting bit rate T_(n+1) from the total target bit rate T even when the (T_(n)−C_(n)) deviates a little from “0”. In a case where the target difference relatively largely deviates, by varying the target setting bit rate T_(n+1), it is possible to optimally follow the target.

In the exemplary case shown in FIG. 5, the function θ₃ has the characteristic of bilateral symmetry (point symmetry with respect to the point (0, 0)). In other words, the rate of change is symmetric with respect to a boundary where the value of the (T_(n)−C_(n)) is 0. By increasing a₁ and decreasing a₀, for example, when the bit rate becomes larger than the target value, the bit rate can be controlled to quickly decrease to the target value and when the bit rate becomes smaller than the target value, the bit rate can be controlled to gently increase to the target value.

Though the target rate difference obtained by subtracting the average output bit rate C_(n) of the second stream in the n period from the target setting bit rate T_(n) in the n period is used in Eq. (23), the target rate difference is further adjusted in Eq. (27).

$\begin{matrix} {T_{n + 1} = {T + {k*\frac{S_{n}}{S_{n - 1}}*\left( {T_{n} - C_{n}} \right)*\frac{_{n}}{}}}} & (27) \end{matrix}$

In Eq. (27), the target rate difference is multiplied by S_(n)/S_(n−1). The S_(n)/S_(n−1) is a period ratio obtained by dividing the average input bit rate S_(n) of the first stream in the n period by the average input bit rate S_(n−1) of the first stream in the (n−1) period. In other words, the target difference is adjusted by multiplying the target rate difference by the period ratio.

By multiplying the target rate difference by the period ratio, it is possible to correct local variation of the target difference. When the average input bit rate S_(n) in the n period varies locally less than that in the (n−1) period, for example, the target rate difference (T_(n)−C_(n)) accordingly increases quickly in some cases. Also in such a case, the target rate difference is adjusted by multiplying the target rate difference by the period ratio S_(n)/S_(n−1) (the period ratio takes a value smaller than 1 in this case) and it is possible to avoid a quick increase of the target setting bit rate T_(n+1).

Further, the target rate difference is multiplied by the target distortion ratio (d_(n)/d). When the average output bit rate C_(n) is smaller than the target setting bit rate T_(n) and the image quality is lower than the target quality, the d_(n)/d becomes larger than 1 and adjustment is made toward the direction to increase the increase range of the target setting bit rate T_(n+1). It is thereby possible to quickly improve the image quality. When the average output bit rate C_(n) is larger than the target setting bit rate T_(n) and the image quality is higher than the target quality, the d_(n)/d becomes smaller than 1 and this decreases the suppression range of the target setting bit rate T_(n+1) and gently suppresses the bit rate.

Also in the method of calculating the target setting bit rate T_(n+1) by using the function θ₃ expressed by Eq. (24), similarly, the target rate difference may be set by multiplying the target rate difference by the period ratio. The target setting bit rate T_(n+1) calculated by this method is shown in Eq. (28). Also in this case, the target rate difference is adjusted with the variation of the average input bit rate and it is possible to avoid significant variation of the target setting bit rate T_(n+1). Further, the target rate difference is multiplied by the target distortion ratio (d_(n)/d) and it is thereby possible to improve the image quality while maintaining the target bit rate.

$\begin{matrix} {T_{n + 1} = {T + {\frac{S_{n}}{S_{n - 1}}*{\theta_{3}\left( {T_{n} - C_{n}} \right)}*\frac{_{n}}{}}}} & (28) \end{matrix}$

In Eq. (27), the value obtained by dividing the average input bit rate S_(n) of the first stream in the n period by the average input bit rate S_(n−1) of the first stream in the (n−1) period is used as the period ratio. On the other hand, in the case where the average input bit rate S_(n+1) of the first stream in the (n+1) period can be obtained in advance, the average input bit rate S_(n+1) may be used. As discussed above, the case where the average input bit rate S_(n+1) can be obtained in advance refers to a case where there is spare time to perform buffering of information on the average input bit rate S_(n+1) and then calculate the target setting bit rate T_(n+1) in the (n+1) period. In this case, in Eq. (27), S_(n+1)/S_(n) is used, instead of the S_(n)/S_(n−1). Thus, by using the average input bit rate S_(n+1) of the first stream in the (n+1) period, it is possible to control the target setting bit rate T_(n+1) with higher accuracy.

In Eq. (28), the target rate difference obtained by using the function θ₃ is multiplied by the period ratio. Also in this case, similarly, the period ratio S_(n+1)/S_(n) may be used, instead of the period ratio S_(n)/S_(n−1). Also in this case, it is thereby possible to control the target setting bit rate T_(n+1) with higher accuracy.

In the case where the average input bit rate S_(n+1) of the first stream in the (n+1) period is used as the period ratio, instead of Eq. (9), as expressed by Eq. (21), by substituting the ratio between the target setting bit rate T_(n+1) in the (n+1) period and the average input bit rate S_(n+1) in the (n+1) period into the function f, the quantization step conversion factor α_(n+1) in the (n+1) period can be calculated.

In Eq. (27) and Eq. (28), the S_(n)/S_(n), is used as the period ratio. Instead, a ratio between the average period bit rate AS_(n) for the past M periods including the n period and the average period bit rate AS_(n−1) for the past M periods including the (n−1) period may be used as the period ratio. In other words, in Eqs. (27) and (28), AS_(n)/AS_(n−1) may be used as the period ratio, instead of the S_(n)/S_(n−1). It is thereby possible to ensure optimal control of the target setting bit rate T_(n+1) by reducing the effect of the local variation.

Alternatively, in the case where the average input bit rate S_(n+1) of the first stream in the (n+1) period can be obtained in advance, the ratio between the average period bit rate AS_(n+1) for the past M periods including the (n+1) period and the average period bit rate AS_(n) for the past M periods including the n period may be used as the period ratio. In other words, in Eqs. (27) and (28), AS_(n+1)/AS_(n) may be used as the period ratio, instead of the S_(n)/S_(n−1). It is thereby possible to control the target setting bit rate T_(n+1) with higher accuracy by reducing the effect of the local variation and using the latest input bit rate.

In the case where the average period bit rate AS_(n+1) for the past M periods including the (n+1) period is used as the period ratio, instead of Eq. (10), as expressed by Eq. (22), by substituting the ratio between the target setting bit rate T_(n+1) in the (n+1) period and the average period bit rate AS_(n+1) for the past M periods including the (n+1) period into the function f, the quantization step conversion factor α_(n+1) in the (n+1) period can be calculated.

Though the method of calculating the target setting bit rate T_(n+1) in the third preferred embodiment has been discussed with reference to Eqs. (23), (24), (27), (28), and the like, in all the cases of the third preferred embodiment including the cases discussed with reference to these equations, the average period bit rate AC_(n) from the (n−M+1) period to the n period may be used, instead of the average output bit rate C_(n) in the n period. In Eqs. (23), (24), (27), (28), and the like, for example, AC_(n) may be used instead of C_(n). It is thereby possible to ensure optimal control of the target setting bit rate T_(n+1) by gently evaluating the variation of C_(n).

The Fourth Preferred Embodiment

Next, the fourth preferred embodiment of the present invention will be discussed. Also in the fourth preferred embodiment, like in the first to third preferred embodiments, the transcoder 1 determines the target setting bit rate T_(n+1) of the second stream in the (n+1) period at the point of time when the n period is finished. The fourth preferred embodiment is an application example of the first to third preferred embodiments.

Though the value obtained by dividing the average distortion evaluation value d_(n) by the total target distortion evaluation value d is used as the target distortion ratio in Eq. (11), the target distortion ratio may be obtained by substituting the d_(n)/d into the function θ₄ as expressed by Eq. (29),

$\begin{matrix} {T_{n + 1} = {T*k*{\theta_{4}\left( \frac{_{n}}{} \right)}}} & (29) \end{matrix}$

Eq. (30) is an equation representing the specific example of the function θ₄.

$\begin{matrix} {{\theta_{4}(x)} = \left\{ \begin{matrix} {x - b_{0}} & \left( {x \geq {1 + c_{0}}} \right) \\ {{a_{0}*\left( {x - 1} \right)^{3}} + 1} & \left( {{1 + c_{0}} > x \geq 1} \right) \\ {{a_{1}*\left( {x - 1} \right)^{3}} + 1} & \left( {1 > x \geq {1 - c_{1}}} \right) \\ {x + b_{1}} & \left( {{1 - c_{1}} > x} \right) \end{matrix} \right.} & (30) \end{matrix}$

A relation of a₀, a₁, b₀, b₁, c₀, and c₁ in Eq. (30) is expressed by Eq. (31). A characteristic of the function θ₄ represented by such a condition is shown in FIG. 6.

$\begin{matrix} {{{a_{0} > 0},{a_{1} > 0}}{{c_{0} = \frac{1}{\sqrt{3*a_{0}}}},{c_{1} = \frac{1}{\sqrt{3*a_{1}}}}}{{b_{0} = {\frac{2}{3}*c_{0}}},{b_{1} = {\frac{2}{3}*c_{1}}}}} & (31) \end{matrix}$

Thus, the function θ₄ has the characteristic that when an input value is close to “1”, an output does not vary significantly from “1”. In the specific examples expressed by Eqs. (30) and (31), the output value also significantly varies from the time when the input value becomes larger than (1+c₀) or becomes smaller than (1−c₁).

By using the function θ₄, it is possible to avoid significant variation of the target setting bit rate T_(n+1) from the total target bit rate T even when the target distortion ratio (d_(n)/d) deviates a little from “1”. In the case where the target distortion ratio relatively largely deviates, by varying the target setting bit rate T_(n+1), it is possible to optimally follow the target.

In the exemplary case shown in FIG. 6, the function θ₄ has the characteristic of bilateral symmetry (point symmetry with respect to the point (1, 1)). The rate of change may be asymmetric with respect to a boundary where the value of the (d_(n)/d) is 1. By increasing a₀ and decreasing a₁, for example, when the distortion evaluation value becomes larger than the target distortion value (in other words, the image quality is degraded), quick improvement of the image quality can be performed and when the distortion evaluation value becomes smaller than the target distortion value, the bit rate can be controlled to gently increase to the target value.

Though the preferred embodiment in which d_(n)/d is replaced by θ₄(d_(n)/d) in Eq. (11) has been discussed with reference to Eq. (29), similarly, d_(n)/d may be replaced by θ₄(d_(n)/d) in Eqs. (15), (16), (19), and (20) in the second preferred embodiment.

When the distortion evaluation value of the image is larger than the target distortion value (the image quality is degraded as compared with the target quality), adjustment can be made toward the direction to increase the target setting bit rate T_(n+1) at a desired speed by using the function θ₄. It is thereby possible to improve the image quality while maintaining the target bit rate on the whole.

Also in Eqs. (23), (24), (27), and (28) in the third preferred embodiment, d_(n)/d may be replaced by θ₄(d_(n)/d). In this case, however, unlike in the case where the function θ₄ is applied to Eqs. (11), (15), and the like, it is necessary to change the characteristic of the function θ₄ a little by adjusting a₀, a₁, b₀, b₁, c₀, and c₁. This is because θ₄ used as a parameter by which the total target bit rate T is multiplied in the case where the function θ₄ is applied to Eqs. (11), (15), and the like but θ₄(d_(n)/d) is used as a parameter by which the target rate difference is multiplied in the case where the function θ₄ is applied to Eq. (23) and the like in the third preferred embodiment.

The Fifth Preferred Embodiment

Next, the fifth preferred embodiment of the present invention will be discussed. Also in the fifth preferred embodiment, like in the first to fourth preferred embodiments, the transcoder 1 determines the target setting bit rate T_(n+1) of the second stream in the (n+1) period at the point of time when the n period is finished.

Though the target distortion ratio obtained by dividing the average distortion evaluation value d_(n) by the total target distortion evaluation value d is used as a parameter in the first to fourth preferred embodiments, a target distortion difference obtained by subtracting the total target distortion evaluation value d from the average distortion evaluation value d_(n) is used as a parameter in the fifth preferred embodiment.

Eq. (32) is an equation for calculation of the target setting bit rate T_(n+1) in the (n+1) period. The target setting bit rate T_(n+1) , is calculated by adding the target distortion difference (d_(n)−d) to the total target bit rate T.

T _(n+1) =T+k*(d _(n) −d)   (32)

In Eq. (32), “k” is a positive factor and a factor for adjusting the target distortion difference. When the image quality is lower than the target quality and the (d_(n)−d) becomes positive, adjustment is made toward the direction to increase the target setting bit rate T_(n+1) and the image quality is improved. When the image quality is higher than the target quality and the (d_(n)−d) becomes negative, the target setting bit rate T_(n+1) is suppressed. The bit rate by which the target setting bit rate T_(n+1) is suppressed may be allocated later to another control unit time where improvement of the image quality will be needed. When the image quality is higher than the target quality, it is not always necessary to suppress the target setting bit rate T_(n+1).

Though the (d_(n)−d) is added as the target distortion difference to the total target bit rate T (while the factor k is also used as a multiplier) in Eq. (32), the target distortion difference may be obtained by substituting the (d_(n)−d) into a function θ₅ as expressed by Eq. (33). In this case, the target setting bit rate T_(n+1) can be adjusted by multiplying the total target bit rate T by the target distortion difference.

T _(n+1) =T*θ ₅(d _(n) −d)   (33)

Eq. (34) is an equation representing the specific example of the function θ₅.

$\begin{matrix} {{\theta_{5}(x)} = \left\{ \begin{matrix} {x - b_{0} + 1} & \left( {x \geq c_{0}} \right) \\ {{a_{0}*x^{3}} + 1} & \left( {c_{0} > x \geq 0} \right) \\ {{a_{1}*x^{3}} + 1} & \left( {0 > x \geq {- c_{1}}} \right) \\ {x + b_{1} + 1} & \left( {{- c_{1}} > x} \right) \end{matrix} \right.} & (34) \end{matrix}$

A relation of a₀, a₁, b₀, b₁, c₀, and c₁ in Eq. (34) is expressed by Eq. (35). A characteristic of the function θ₅ represented by such a condition is shown in FIG. 7.

$\begin{matrix} {{{a_{0} > 0},{a_{1} > 0}}{{c_{0} = \frac{1}{\sqrt{3*a_{0}}}},{c_{1} = \frac{1}{\sqrt{3*a_{1}}}}}{{b_{0} = {\frac{2}{3}*c_{0}}},{b_{1} = {\frac{2}{3}*c_{1}}}}} & (35) \end{matrix}$

Thus, the function θ₅ has the characteristic that when an input value is close to “0”, an output does not vary significantly from “1”. In the specific examples expressed by Eqs. (34) and (35), the output value also significantly varies from the time when the input value becomes larger than c₀ or becomes smaller than −c₁.

By using the function θ₅, it is possible to avoid significant variation of the target setting bit rate T_(n+1) from the total target bit rate T even when the (d_(n)−d) deviates a little from “0”. In a case where the target distortion difference relatively largely deviates, by varying the target setting bit rate T_(n+1), it is possible to improve the image quality in accordance with the degree of the distortion.

In the exemplary case shown in FIG. 7, the function θ₅ has the characteristic of bilateral symmetry (point symmetry with respect to the point (0, 1)). The rate of change may be asymmetric with respect to a boundary where the value of the (d_(n)−d) is 0. By increasing a₀ and decreasing a₁, for example, when the average distortion evaluation value d_(n) becomes larger than the total target distortion evaluation value d, since the image quality is degraded as compared with the target quality, quick improvement of the image quality is performed. In the reverse case where the average distortion evaluation value d_(n) becomes smaller than the total target distortion evaluation value d, since the image quality is higher than the target quality, the target setting bit rate T_(n+1) is gently suppressed.

Though the value of the (d_(n)−d) is used as the target distortion difference in Eq. (32), the (d_(n)−d) may be substituted into a function θ₆ as expressed by Eq. (36).

T _(n+1) =T+θ ₆(d _(n) −d)   (36)

Eq. (37) is an equation representing the specific example of the function θ₆.

$\begin{matrix} {{\theta_{6}(x)} = \left\{ \begin{matrix} {x - b_{0}} & \left( {x \geq c_{0}} \right) \\ {a_{0}*x^{3}} & \left( {c_{0} > x \geq 0} \right) \\ {a_{1}*x^{3}} & \left( {0 > x \geq {- c_{1}}} \right) \\ {x + b_{1}} & \left( {{- c_{1}} > x} \right) \end{matrix} \right.} & (37) \end{matrix}$

A relation of a₀, a₁, b₀, b₁, c₀, and c₁ in Eq. (37) is expressed by Eq. (38). A characteristic of the function θ₆ represented by such a condition is shown in FIG. 8.

$\begin{matrix} {{{a_{0} > 0},{a_{1} > 0}}{{c_{0} = \frac{1}{\sqrt{3*a_{0}}}},{c_{1} = \frac{1}{\sqrt{3*a_{1}}}}}{{b_{0} = {\frac{2}{3}*c_{0}}},{b_{1} = {\frac{2}{3}*c_{1}}}}} & (38) \end{matrix}$

Thus, the function θ₆ has the characteristic that when an input value is close to “0”, an output does not vary significantly from “0”. In the specific examples expressed by Eqs. (37) and (38), the output value also significantly varies from the time when the input value becomes larger than c₀ or becomes smaller than −c₁.

By using the function θ₆, it is possible to avoid significant variation of the target setting bit rate T_(n+1) from the total target bit rate T even when the (d_(n)−d) deviates a little from “0”. In a case where the target distortion difference relatively largely deviates, by varying the target setting bit rate T_(n+1), it is possible to improve the image quality in accordance with the degree of the distortion.

In the exemplary case shown in FIG. 8, the function θ₆ has the characteristic of bilateral symmetry (point symmetry with respect to the point (0, 0)). The rate of change may be asymmetric with respect to a boundary where the value of the (d_(n)−d) is 0. By increasing a₀ and decreasing a₁, for example, when the average distortion evaluation value d_(n) becomes larger than the total target distortion evaluation value d, since the image quality is degraded as compared with the target quality, quick improvement of the image quality is performed. In the reverse case where the average distortion evaluation value d, becomes smaller than the total target distortion evaluation value d, since the image quality is higher than the target quality, the target setting bit rate T_(n+1) is gently suppressed.

The Sixth Preferred Embodiment

In the first to fifth preferred embodiments, the average distortion evaluation value d_(n) is used as a parameter in order to calculate the target setting bit rate T_(n+1). Herein, in the first to fifth preferred embodiments, the average period distortion evaluation value Ad_(n) may be used, instead of the average distortion evaluation value d_(n). In Eqs. (11), (12), (15), (16), (19), (20), (23), (24), (27), (28), (29), (32), (33), (36), and the like, the average distortion evaluation value d_(n) has only to be replaced by the average period distortion evaluation value Ad_(n).

It is thereby possible to optimally control the target setting bit rate T_(n+1) while gently evaluating the variation of the average distortion evaluation value d_(n) and improving the image quality, without being affected by local distortion of the image quality.

The Seventh Preferred Embodiment

In the transcoder 1 shown in FIG. 1, the second decoder 17 for decoding the second stream outputted from the encoder 15 is separately provided. In the seventh preferred embodiment, as shown in FIG. 9, data outputted from a local decoder 151 in the encoder 15 is outputted to the distortion calculation part 18.

The local decoder 151 is a processing block used to decode again the image data encoded in the encoder 15, to thereby perform interframe prediction. By using the local decoder 151 both for encoding and for calculation of the distortion evaluation value, there is no necessity of separately providing a decoder for calculating the distortion evaluation value and this allows compact device configuration.

While the present invention has been described with reference to the particular illustrative embodiments, it is not to be restricted by the embodiments but only by the appended claims. It is to be appreciated that those skilled in the art can change or modify the embodiments without departing from the scope and spirit of the present invention. 

1. A transcoder for converting a first stream into a second stream, comprising: a part for setting a target bit rate (T) of a whole second stream; and a target bit rate determination part for determining a target bit rate (T_(n+1)) of a second stream in the (N+1) period on the basis of said target bit rate (T) of said whole second stream, a distortion evaluation value of a second stream converted in or before the N period, and a target distortion evaluation value (d) of said whole second stream.
 2. The transcoder according to claim 1, wherein assuming that a value calculated out on the basis of a ratio between said distortion evaluation value of said second stream converted in or before the N period and said target distortion evaluation value (d) of said whole second stream is determined as a target distortion ratio, said target bit rate determination part calculates said target bit rate (T_(n+1)) of said second stream in the (N+1) period by multiplying said target bit rate (T) of said whole second stream by said target distortion ratio.
 3. The transcoder according to claim 1, wherein assuming that a value calculated out on the basis of a ratio between said distortion evaluation value of said second stream converted in or before the N period and said target distortion evaluation value (d) of said whole second stream is determined as a target distortion ratio, said target bit rate determination part calculates said target bit rate (T_(n+1)) of said second stream in the (N+1) period by adding said target distortion ratio to said target bit rate (T) of said whole second stream.
 4. The transcoder according to claim 1, wherein assuming that a value calculated out on the basis of a difference between said distortion evaluation value of said second stream converted in or before the N period and said target distortion evaluation value (d) of said whole second stream is determined as a target distortion difference, said target bit rate determination part calculates said target bit rate (T_(n+1)) of said second stream in the (N+1) period by adding said target distortion difference to said target bit rate (T) of said whole second stream.
 5. The transcoder according to claim 1, wherein assuming that a value calculated out on the basis of a difference between said distortion evaluation value of said second stream converted in or before the N period and said target distortion evaluation value (d) of said whole second stream is determined as a target distortion difference, said target bit rate determination part calculates said target bit rate (T_(n+1)) of said second stream in the (N+1) period by multiplying said target bit rate (T) of said whole second stream by said target distortion difference.
 6. A transcoder for converting a first stream into a second stream, comprising: a part for setting a target bit rate (T) of a whole second stream; and a target bit rate determination part for determining a target bit rate (T_(n+1)) of a second stream in the (N+1) period on the basis of said target bit rate (T) of said whole second stream, a bit rate and a distortion evaluation value of a second stream converted in or before the N period, a target bit rate (T_(n)) of a second stream in the N period, and a target distortion evaluation value (d) of said whole second stream.
 7. The transcoder according to claim 6, wherein assuming that a value calculated out on the basis of a ratio between said distortion evaluation value of said second stream converted in or before the N period and said target distortion evaluation value (d) of said whole second stream is determined as a target distortion ratio and a value calculated out on the basis of a ratio between said bit rate of said second stream converted in or before the N period and said target bit rate (T_(n)) of said second stream in the n period is determined as a target rate ratio, said target bit rate determination part calculates said target bit rate (T_(n+1)) of said second stream in the (N+1) period by multiplying a ratio between said target bit rate (T) of said whole second stream and said target rate ratio by said target distortion ratio.
 8. The transcoder according to claim 7, wherein assuming that a ratio between a bit rate (S_(n−1)) of a first stream in the (N−1) period and a bit rate (S_(n)) of a first stream in the N period is determined as a period ratio, said target bit rate determination part adjusts said target rate ratio by multiplying said target rate ratio by said period ratio.
 9. The transcoder according to claim 7, wherein assuming that a ratio between a bit rate (S_(n)) of a first stream in the N period and a bit rate (S_(n+1)) of a first stream in the (N+1) period is determined as a period ratio, said target bit rate determination part adjusts said target rate ratio by multiplying said target rate ratio by said period ratio.
 10. The transcoder according to claim 7, wherein assuming that a ratio between an average bit rate (AS_(n−1)) of first streams in a plurality of past periods including the (N−1) period and an average bit rate (AS_(n)) of first streams in a plurality of past periods including the N period is determined as a period ratio, said target bit rate determination part adjusts said target rate ratio by multiplying said target rate ratio by said period ratio.
 11. The transcoder according to claim 7, wherein assuming that a ratio between an average bit rate (AS_(n)) of first streams in a plurality of past periods including the N period and an average bit rate (AS_(n+1)) of first streams in a plurality of past periods including the (N+1) period is determined as a period ratio, said target bit rate determination part adjusts said target rate ratio by multiplying said target rate ratio by said period ratio.
 12. The transcoder according to claim 6, wherein assuming that a value calculated out on the basis of a ratio between said distortion evaluation value of said second stream converted in or before the N period and said target distortion evaluation value (d) of said whole second stream is determined as a target distortion ratio and a value calculated out on the basis of a difference between said target bit rate (T_(n)) of said second stream in the n period and said bit rate of said second stream converted in or before the N period is determined as a target rate difference, said target bit rate determination part calculates said target bit rate (T_(n+1)) of said second stream in the (N+1) period by adding a product of said target rate difference and said target distortion ratio to said target bit rate (T) of said whole second stream.
 13. The transcoder according to claim 12, wherein assuming that a ratio between a bit rate (S_(n)) of a first stream in the N period and a bit rate (S_(n−1)) of a first stream in the (N−1) period is determined as a period ratio, said target bit rate determination part adjusts said target rate difference by multiplying said target rate difference by said period ratio.
 14. The transcoder according to claim 12, wherein assuming that a ratio between a bit rate (S_(n+1)) of a first stream in the (N+1) period and a bit rate (S_(n)) of a first stream in the N period is determined as a period ratio, said target bit rate determination part adjusts said target rate difference by multiplying said target rate difference by said period ratio.
 15. The transcoder according to claim 12, wherein assuming that a ratio between an average bit rate (AS_(n)) of first streams in a plurality of past periods including the N period and an average bit rate (AS_(n−1)) of first streams in a plurality of past periods including the (N−1) period is determined as a period ratio, said target bit rate determination part adjusts said target rate difference by multiplying said target rate difference by said period ratio.
 16. The transcoder according to claim 12, wherein assuming that a ratio between an average bit rate (AS_(n+1)) of first streams in a plurality of past periods including the (N+1) period and an average bit rate (AS_(n)) of first streams in a plurality of past periods including the N period is determined as a period ratio, said target bit rate determination part adjusts said target rate difference by multiplying said target rate difference by said period ratio.
 17. The transcoder according to claim 6, wherein a bit rate (C_(n)) of said second stream in the N period is used as said bit rate of said second stream converted in or before the N period.
 18. The transcoder according to claim 6, wherein an average bit rate (AC_(n)) of second streams in a plurality of past periods including the N period is used as said bit rate of said second stream converted in or before the N period.
 19. The transcoder according to claim 1, wherein a distortion evaluation value (d_(n)) of said second stream in the N period is used as said distortion evaluation value of said second stream converted in or before the N period.
 20. The transcoder according to claim 1, wherein an average distortion evaluation value (Ad_(n)) of second streams in a plurality of past periods including the N period is used as said distortion evaluation value of said second stream converted in or before the N period.
 21. The transcoder according to claim 1, wherein said distortion evaluation value of said second stream is calculated on the basis of a sum of squared differences of an image obtained by decoding said first stream and an image obtained by decoding said second stream.
 22. The transcoder according to claim 1, wherein said distortion evaluation value of said second stream is calculated on the basis of a sum of absolute differences of an image obtained by decoding said first stream and an image obtained by decoding said second stream.
 23. The transcoder according to claim 1, further comprising a part for calculating a quantization step conversion factor (a_(n+1)) in the (N+1) period on the basis of a ratio between said target bit rate (T_(n+1)) of said second stream in the (N+1) period which is determined by said target bit rate determination part and a bit rate of a first stream in or before the N period; and a part for calculating a quantization step value of said second stream in the (N+1) period by multiplying a quantization step value of a first stream in the (N+1) period by said quantization step conversion factor (α_(n+1)) in the (N+1) period.
 24. The transcoder according to claim 1, further comprising a part for calculating a quantization step conversion factor (α_(n+1)) in the (N+1) period on the basis of a ratio between said target bit rate (T_(n+1)) of said second stream in the (N+1) period which is determined by said target bit rate determination part and a bit rate of a first stream in or before the (N+1) period; and a part for calculating a quantization step value of said second stream in the (N+1) period by multiplying a quantization step value of a first stream in the (N+1) period by said quantization step conversion factor (α_(n+1)) in the (N+1) period.
 25. The transcoder according to claim 2, wherein the rate of change in said calculated target distortion ratio is asymmetric with respect to a boundary where said ratio between said distortion evaluation value of said second stream converted in or before the N period and said target distortion evaluation value (d) of said whole second stream is
 1. 26. The transcoder according to claim 4, wherein the rate of change in said calculated target distortion difference is asymmetric with respect to a boundary where said difference between said distortion evaluation value of said second stream converted in or before the N period and said target distortion evaluation value (d) of said whole second stream is
 0. 27. The transcoder according to claim 21, wherein a local decoder in an encoder is used as a processing part for decoding said second stream in order to calculate said distortion evaluation value of said second stream.
 28. The transcoder according to claim 22, wherein a local decoder in an encoder is used as a processing part for decoding said second stream in order to calculate said distortion evaluation value of said second stream. 